11:23
2026-06-17
cryptobriefing.com
large-language-models
Stanford researcher releases SEFD dataset for machine-readable SEC filings
Stanford researchers released the Stanford EDGAR Filings Dataset (SEFD), a 152 billion token reconstruction of SEC EDGAR filings from 1994 to present in a layout-faithful MultiMarkdown format, achieviβ¦